A regression framework for assessing covariate effects on the reproducibility of high-throughput experiments.

نویسندگان

  • Qunhua Li
  • Feipeng Zhang
چکیده

The outcome of high-throughput biological experiments is affected by many operational factors in the experimental and data-analytical procedures. Understanding how these factors affect the reproducibility of the outcome is critical for establishing workflows that produce replicable discoveries. In this article, we propose a regression framework, based on a novel cumulative link model, to assess the covariate effects of operational factors on the reproducibility of findings from high-throughput experiments. In contrast to existing graphical approaches, our method allows one to succinctly characterize the simultaneous and independent effects of covariates on reproducibility and to compare reproducibility while controlling for potential confounding variables. We also establish a connection between our model and certain Archimedean copula models. This connection not only offers our regression framework an interpretation in copula models, but also provides guidance on choosing the functional forms of the regression. Furthermore, it also opens a new way to interpret and utilize these copulas in the context of reproducibility. Using simulations, we show that our method produces calibrated type I error and is more powerful in detecting difference in reproducibility than existing measures of agreement. We illustrate the usefulness of our method using a ChIP-seq study and a microarray study.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DNA Sequence Fragment Containing C to A Mutation as a Convenient Mutation Standard for DHPLC Analysis

Objective(s):  Denaturing high performance liquid chromatography (DHPLC) is a high throughput approach for screening DNA sequence variations. To assess oven calibration, cartridge performance, buffer composition and stability, the WAVE Low and High Range Mutation Standards are employed to ensure reproducibility and accuracy of the chromatographic analysis. The purpose of this study was to provi...

متن کامل

Measuring Reproductibility of High-Throughput Biological Experiments

Reproducibility is essential to reliable scientific discovery in large-scale high-throughput biological studies. In this talk, I will present a unified approach to measure reproducibility of findings identified from replicate experiments and select discoveries using reproducibility between replicates. Unlike the usual scalar measures of reproducibility, our approach views reproducibility as whe...

متن کامل

Nootropic Medicinal Plants; Evaluating Potent Formulation By Novelestic High throughput Pharmacological Screening (HTPS) Method

The principle of this method was to screen the pharmacological activity of six prepared polyphyto formulations by using high throughput screening method for their nootropic action. The study was performed in three stages using one, two and three animals, respectively in a group. Test formulations were given p.o daily at the dose of 50 and 100 mg/kg body weight. The test formulations were compar...

متن کامل

Combining machine learning and matching techniques to improve causal inference in program evaluation.

RATIONALE, AIMS AND OBJECTIVES Program evaluations often utilize various matching approaches to emulate the randomization process for group assignment in experimental studies. Typically, the matching strategy is implemented, and then covariate balance is assessed before estimating treatment effects. This paper introduces a novel analytic framework utilizing a machine learning algorithm called o...

متن کامل

Size Reproducibility of Gadolinium Oxide Based Nanomagnetic Particles for Cellular Magnetic Resonance Imaging: Effects of Functionalization, Chemisorption and Reaction Conditions

We developed biofunctionalized nanoparticles with magnetic properties by immobalizing diethyle-neglycol (DEG) on Gd2O3, and PEGilation of small particulate gadolinium oxide (SPGO) with two me-thoxy-polyethyleneglycol-silane (mPEG-Silane 550 and 2000 Da) using a new supervised polyol route, described recently. In conjunction to the previous study to achieve a high quality synthesis and increase ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biometrics

دوره   شماره 

صفحات  -

تاریخ انتشار 2017